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Abstract: Lot X = {Xij : 1 < i, j < n} be an n X n array of independent random 
variables. Let n be a uniform random permutation of {1, 2, . . . , n}, independent of X, 
and let W = J2i=i X i-n(i) ■ Suppose X is standardized so that EW = 0, Var(VK) = 1. We 
prove that the Kolmogorov distance between the distribution of W and the standard 
normal distribution is bounded by 447 521^=1 | 3 /n. Our approach is by Stein's 

method of exchangeable pairs and the use of a concentration inequality. 
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1. Introduction and statement of the main result 

Motivated by permutation tests in non-parametric statistics, Wald and Wolfowitz [20] proved 
a central limit theorem for the combinatorial statistics Y^i=i a i^(i) w hcre {ai,bj : i,j € 
[n] := {1,2,. . . , n}} are real numbers and n is a uniform random permutation of [n]. Their 
result was generalized to real arrays {cy : i,j € [n]} by Hoeffding [14]. Extension to random 
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arrays {X^ : i, j £ [n]} where (Xij) arc independent random variables was considered by Ho 
and Chen [13]. Using the concentration inequality approach in Stein's method, they proved a 
bound on the Kolmogorov distance between the distribution of Yl7=i -^wr(i) an d the normal 
distribution with the same mean and variance. The bound in [13] is optimal only when 
< C for an absolute constant C. A third-moment bound for a combinatorial central 
limit theorem for real arrays {c^ : i, j G [n]} was obtained by Bolthausen [2], who used Stein's 
method and induction. However, the absolute constant in the bound in [2] is not explicit. A 
bound with an explicit constant for real arrays with |cy | < C was obtained by Goldstein [11] 
using Stein's method and zero-bias coupling (see also [6]). Under the same setting as Ho and 
Chen [13], Ncammance and Suntornchost [16] stated a third- moment bound. They used the 
same Stein identity in [13], which dates back to Chen [3], and the concentration inequality 
approach. However, there is an error in the proof in [16], where the first equality and the 
second inequality on page 576 are incorrect because of the dependence among S(t), AS and 
M(t). 

In this paper, we give a different proof of the combinatorial central limit theorem. Our 
result gives a third-moment bound with an explicit constant under the setting of Ho and 
Chen. Our approach is by Stein's method of exchangeable pairs and the use of a concentration 
inequality. The use of an exchangeable pair simplifies the construction of a Stein identity as 
compared to the construction in [13] and [16]. 

Stein's method was introduced by Stein ([18], [19]) and has became a popular tool in 
proving distributional approximation results because of its power in handling dependence 
among random variables. We refer to Barbour and Chen [1] and Chen, Goldstein and Shao 
[6] for an introduction of Stein's method. The notion of exchangeable pair was introduced 
by Stein [19], and first used in Diaconis [10]. The concentration inequality approach was also 
introduced by Stein (see [13]) and was developed by Chen ([4], [5]) and Chen and Shao ([7], 
[8]). This approach provides a smoothing technique and a way of obtaining third-moment 
bounds on the Kolmogorov distance. 

The following is our main result. 

Theorem 1.1. Let X = {X^ : i,j S [n]} be an nx n array of independent random variables 
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where n > l,EJQj = c^, VarJQj = afj. Assume 



"3 



c. = (1.1) 



where Ci. = $3j=i c ij/ n > c j = £j=i c ij/ n > c - = 2ii=i c ij/ n2 - Let it be a uniform random 
permutation of [n], independent o/X, and Zet = X)"=i -^wrf*)- T/ien 

1 n 1 n 

(2) assuming Var(W) = 1, we have 

sup\P(W < z) - $(z)\ < 447 7 (1.3) 

where $ zs i/ie standard normal distribution function and 

1 " 

For real arrays, we have the following corollary of Theorem 1.1 by letting Uij = 0. 
Corollary 1.2. Let {cij : i,j € [n]} &e an n x n reaZ array. Assume 

Ci. = c.j = a. =0 

w/iere = £)"=i cy/n, c.j = 2"=i c y/ n ; c - = £ij=i c ij/ n2 > an( ^ assume 

Lei it be a uniform random permutation of [n], and let W = X)"=i c in(i)- Then we have 

A AH n 

S up\P(W<z)-<S>(z)\<—Y, M 3 - (1-5) 
Moreover, if |cy | < C /or a/Z !,) £ [n] , £/ien 

sup|P(l^ < z) - $(z)| < 447C. (1.6) 

zGE 

We remark that the constant 447 in Corollary 1.2 can be reduced by a modification of 
the proof of Theorem 1.1 for this special case. The error bound in (1.5) was obtained by 

imsart-generic ver. 2011/11/15 file: Comb_CLT_Arxiv.tex date: June 1, 2012 



L.H.Y. Chen et al. /Combinatorial central limit theorem 4 

Bolthausen [2] except that the constant in his bound is not explicit. The error bound in 
(1.6) is of the same order as that in [11] and in Theorem 6.1 of [6], although the constant in 
Theorem 6.1 is smaller than that in (1.6). 

For any n x n array of independent random variables, we can standardize it and apply 
Theorem 1.1 to obtain the following corollary. 

Corollary 1.3. Let Y = {Yy : i, j € [n]} be an nx n array of independent random variables 
where n > l,EYy = jUy, VarY^j = afy Let it be a uniform random permutation of [n], 
independent ofY, and let V = 22I=i ^wr(-t)- Then we have 

sup\P( V ~ ntX - <z)-$(z)\<— V E|Ky — fii. — fx.j + fi..\ 3 /a 3 
z m a n 



whe 



3=1 i=l ij=l 



^ n ^ n 

a 1 = Var(V) = - ^ <t£ + ^—j ^ (Mij ~ Mi- - M-j + H~? 



Pi 3 - /'. - t- /'•• 2 

i,J=l " i,j=l 

and $ is £/ie standard normal distribution function. 



In [21], as a step in the algorithm which generates a random mapping in the Johnson- 
Lindenstrauss lemma [15], the following problem is considered. Let {Yi,...,Y n } be inde- 
pendent mean zero random variables. For a positive integer k < n, let {Yj i; . . . , Y^ k } be 
uniformly chosen from {Y±, . . . ,Y n } without replacement, and let V = X<-t=i ^fr Suppose 
{Yx, . . . , Y n } are standardized so that Y17=i ^^i = fi/k, which is equivalent to Var(U) = 1. 
In [21], it is shown that |V| is well concentrated around E|V|, therefore an estimate of E|V| 
is of interest. Using Stein's method, the bound 

V 7r 2n 

is proved in [21] as a corollary of a general bound on the Wasserstein distance between C(V) 
and N(0, 1). Using the same idea as in the Poisson approximation for the hypergeometric 
distribution in Corollary 3.4 of [3], we let the nx n array Y be such that the first k rows 
are independent copies of {Yi, . . . , Y n } and the other rows are zeros. Then £(^™ =1 Y^U)) = 
£(%2i=i ^?;)- Therefore we have the following result by Corollary 1.3. 

imsart-generic ver. 2011/11/15 file: Comb_CLT_Arxiv.tex date: June 1, 2012 



L.H.Y. Chen et al. /Combinatorial central limit theorem 5 

Corollary 1.4. Let {Y\, . . . , Y n } be independent random variables with EY"i = /i,, Var(Yi) = 
erf. For a positive integer k < n, let {Y^ , . . . , Y^ k } be uniformly chosen from {Yi, . . . , Y n } 
without replacement, and let V = $3»=i ■ we ftowe 

sup\P( V ~ kfl <z)-$(z)\ 

, n , , n , (1. 7 ) 

447 r. v-i„iK, , n — k, ,,3 , , , v-^ , fc , ,,3" 

< — fc^E - Fi-Mi + ii-M + n-fc £ -0*i-# 

ncr 3 L ' 1 n n 1 z — ' 1 n 1 J 

i=i i=i 

where 

P = E /*i/n, - Var(y) = * £ o? + f> - , 

j=i »=i v ' »=i 

and $ is £/ie standard normal distribution function. 

If /ij = for all i e [n], then er 2 = £ 532= 1 °f an d the error bound in (1.7) reduces 

same as the Wasscrstcin distance bound obtained in [211 

except for the constant. 

The case where Oi = for all i <E [n] in Corollary 1.4 was considered by Goldstein [12] 
using zero-bias coupling, where a similar bound with a smaller constant was proved for the 
Wasserstein distance in simple random sampling (see Theorem 5.1 of [12]). Although we 
will not consider Wasserstein distance in this paper, we wish to mention that a Wasserstein 
distance bound can be obtained for the normal approximation for C(W) where W is defined 
in Theorem 1.1. The proof for the bound will not require a concentration inequality and the 
bound will have a smaller constant. Sec Section 2 of [13] for a discussion. 

In the next section, we prove a concentration inequality using exchangeable pairs (Lemma 
2.1) and apply it to random variables with a combinatorial structure similar to that of W 
in Theorem 1.1. In Section 3, we prove the main result Theorem 1.1 by Stein's method of 
exchangeable pairs and the concentration inequality approach. 

2. Concentration inequalities via exchangeable pairs 

The next lemma provides a bound on P(S 6 [a, b]) assuming the existence of an exchangeable 
pair (S, S') and an approximate linearity condition. 
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Lemma 2.1. Suppose (<S, <S") is an exchangeable pair of square integrable random variables 
and satisfies the following approximate linearity condition 

E(S' - S\S) = -\S + R (2.1) 

for a positive number A and a random variable R. Then, for a < b, 

P(Se[a,b}) 

E\S\+E\R\/X b-a 
~ ES 2 -E\SR\/X- 1/2 1 2 ' (2.2) 



ES 2 - E\SR\/X 



where 



6 = ns>-s\* 

A 



provided that ES 2 - E\SR\/X - 1/2 > 0. 

Remark 2.2. If R = in (2.1) and ES 2 = 1, the bound on the right-hand side of (2.2) 
becomes 

b-a + 28 + 2^Var^E(^(S" - S) 2 I(\S' - S\ < 6)\S) 

Bounding the last term involves studying the conditional distribution of (5' — S) 2 given S , 
which is common in the literature of Steins method of exchangeable pairs. An advantage of 
our bound is that it truncates \S' — S\ at 5 thus allowing us to keep within third moments. 

Proof. Assume S < oo without loss of generality. From the exchangeability of S and 5", 

E(S'-S)(f(S') + f(S))=0 (2.4) 

for all / such that the above expectation exists. Therefore, 

E(S' - S)(f(S') - f(S)) = 2E(S - S')f(S). 

Using the approximate linearity condition (2.1) for the right-hand side of the above equation, 
we have for absolutely continuous /, 

rS'-S 



ESf(S) = ^E(S' -S)J f'(S + t)dt + jERf(S). 



(2.5) 
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The identity (2.4) was introduced by Stein [19] and (2.5) was obtained by Stein [19] in the 
case R = and by Rinott and Rotar [17] for R ^ 0. 

Let / be such that f(w) = I(a-S <w < b + S) and /(^) = 0. Therefore, |/| < ^ + 5. 
Using the property that for all w, w' € R, 

pW —w 

(w' -w) f(w + t)dt > 0, 

Jo 

we have 

l.E{S'-S)J f(S + t)dt 

> -^-E(5' - S) [ f'(S + t)dtI(\S' -S\< S)I(S G [a, b}) 
2A Jq 



E/(Se [a,6])^(S'-S) 2 /(|S'-S| <S) 



= EI(S £ [a,b]) 



S( - S) 2 I(\S> -S\<5)\S)- E±(S' S) 2 I(\S> -S\<5) 



+ EI(S G [a, 6])E— (5" - S) 2 I(\S' -S\<S) 
2 a 

:= R\ + i?2' 
Using the Cauchy-Schwarz inequality, 



|i?i| < ^Vbx(e(±(S> - S^JflS' - 5| < 5)|5 
From (2.1), 

E(S" - S f = 2ES{\S -R) = 2XES 2 - 2ESR. 

Therefore, 

R 2 = P{S G [a, 6])E— (S' - S) 2 - P{S G [a, &])E-J-(S" - 5) 2 /(|5' -S\>8) 
2 A Z\ 

> P(S G M])(ES 2 - ^) - P(S G [a> 6])i5!^!! 

= P(5G[a,fe])(E5 2 -^-i) 

>P(5E[a, fo]) (E5 2 -«-i) 

where in the last equality we used the definition of <5 in (2.3). Using the fact that |/| < ^=^+5, 
we have 

|E5/(5)| < ( b -^ + 5)E\S\, \\mf(S)\ < (^ + <5)ffl. 
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The lemma is proved by applying all the above bounds to (2.5). 



8 
□ 



Now we apply Lemma 2.1 to establish a concentration inequality for a sum S which is 
defined as follows. Let X be the n x n array defined in Theorem 1.1 satisfying (1.1) and 

For n > 6. Let X,Jc\ [n] with cardinalities \X\ = \J~\ = m € {2,3,4}. We remove rows X 
and columns J from the original array X. Since the argument in the proof will not depend 
on the choices of X and J given m, we assume the rows and columns removed are the last m 
rows and m columns for simplification of notation. Let r be an independent uniform random 
permutation of [n — m]. Define the variable S by 



n—m 
i=l 



(2.7) 



Proposition 2.3. Let S be defined by (2.7) for some m £ {2,3,4}. Suppose 7 < 1/cq where 
7 was defined in (1.4), and Cq and n > 6 are large enough to satisfy 

2y / n / n 







2n 



24n 



(n - 4)c^ /3 (n - 5)= 



24n 

(n - 5) : 



> 0. 



TTien /or oH a < b, 



where 



P(5e [a, 6]) < Cl (6- a ) + c 2 7 



Cl 



24n 



(n-5) 2 



C2 



64n 



8rj 



16n 



32(^-) 3 



1 r 32n 



n - 4 



(2.8) 

(2.9) 
(2.10) 
(2.11) 



.(n - 4) 2 n — 4 v n - 4' 
To prove Proposition 2.3, we need the following lemma which estimates the second moment 
of S. 

Lemma 2.4. Let S be defined by (2.7) for some m € {2,3,4} and n > 6. Suppose 7 < 1/co 
where 7 was defined in (1.4). Under the assumptions (1.1) and (2.6), we have 
n-l 2n 24n „ „ 9 n 24n 



(n-4)cjf (""5) 2 



< ES" 2 < 



(n - 5) 



2 ' 



(2.12) 
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E - E E - E E + E E 

l<i,j,k,l<n—m l<i—j<n—rn l<.k,l<.n—m l<i,j<n—m l<k—l<-n—m l<i—j<n—m l<k— l<n— m 

and using the assumption (1.1), 
ES 2 

n—tn n—m 

= E (E X ir{i)f = E EX ir(i) + E EX ir(i) X jr(J) 
i—1 i—1 l<i^j<n—m 

- n — m ^ 

= E EX S + 7 v 77 E E EX*** 

n — m f-r* [n — m)[n — m — I ) . f— ' f — ' 



i,j=l l<ij£j<ri—m l<ky^l<.n—m 

n—m 



(2.13) 



(n — m)(n — m— 1) 

= " E ( C ii + + 7 7T~ TT E E Ci/.CjZ 

n - m J 3 (n - m)(n - m - 1) ^ ^ 

i,j=l v /v ; l<i^tj<n-m l<k^l<n-m 

n—m _^ n—m 

— E cr ?7 + 7 E c % 

n — m *■ — ' J n — m — 1 * — ' J 

_^ n—m 

+ 7 77 TT E c y( E c * fc + E c y + E Cfc; )- 

(n — m)(n — ra— 1) ^— ' z — ' z — ' ^— ' 

x l,J=l fe=n— m+1 l—n—m+1 k,l—n—m+l 

Under the assumption (2.6), E5* 2 is close to 1 intuitively. We quantify it as follows. From 
(1.1) and (3.1), 



n—m n n—m n 



I EM E **)l = l£( E <^) 2 l<lE™ E 4l<"»(n-l). 

ijj=l fc— n— m+1 i—1 fc— n— m+1 i—1 fe=n— m+1 

Similarly, 

n—m n 

i E E q jI - m (" ~ 

1 £=n— m+1 

Moreover, 

n—m n n n 

I £ ]T c fe; | = |( ^ c fc; ) 2 | < m 2 J2 °ki ^ ™ 2 (n-l). 

ij — l k,l=n — m+1 fe,Z=n— m+1 fc,i— n— m+1 

Therefore, from (2.13), 

^^,9 1 , 9 9n (2m + m 2 )(n — 1) n 24n 

E5 ^ 7 E K' + 4 + r 77"^ 77 ^ 7 + 7 777- 

n — m—L £ — ' J J in — m)[n — m — 1 n — 5 n — or 

i,j—l 
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Since 7 < 1/co, using Holder's inequality, we have 

n 

E (4 + 4) = E ^ (M 1/3 ( E ei^i 3 ) 273 < Vc! 

l<i,j<n:i>n-m l<«,j<n:i>n-m i,J = l 

op j>n-m op j>n-m 

A lower bound can be obtained as 

24n 



2/3 



ES 



> E (4 + 4) 



n — m ^— ' y 13 fn — 5) 2 

ij"=i ' 

= ^~ ~ m ? ( 4 + 4) - E (4 + 4) - (n _ 5)2 

j,J=l l<i,j<TP:i>n- m V y 

or j>n-m 

n-l 2n 24n 

> 



n-2 (n-4)c 2 /3 (n - 5)*' 

□ 



Proof of Proposition 2.3. For any m <E {2,3,4}, we construct an exchangeable pair (S,S') 
by uniformly selecting two different indices I,j£ [n — m] and letting S' = S — Xj T ^j^ — 
Xjr(j) + x It(J) + x Jt(i)- An approximate linearity condition with an error term can be 
established as 



E(5' - 5|S) 
1 



[n — m)(n — m — 1) 



1 <*,j <n— m 
n—m 



X ir{j) + X jr(i) - X i T (i) + X jr(j) 



(n — m)(n 
= -XS + R 
where A = 2/(n — m — 1) and 



^ 71— TO. N 

z - m -n E 2E^-2(n-m)5|g 

; k t,.7=l J 



s 



(2.14) 



7? 



(n — m)(n — m — 1) 



-E(£ ^ 3 -|5). 



To apply the concentration inequality in Lemma 2.1, we need to: 



1. Bound VE~R2/A. 

2. Bound 



5 = 



E|5'-5| a 



(2.15) 
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3. Bound 



Firstly, 



'Var( E(^ — (S' - S) 2 I{\S' - S\ < S)\S 



< 



(n — m)(n — m — 1) \ 
2 

[n — m)(n — m — 1) \ 
2 



E(E(^Xy|5))2 
i, 3 = l 



Var(E(^Xy|5)) + (E ]T X y -) 2 
«,j=i «')j=i 



(n — m)(n — m — 1) \ 

2 n—rn n 

(n-m)(n-m-l)\ S 4 + ( E ^i) 2 

x ^ i,j=± i,j—n—m-\-l 



< 



< 



< 



(n-m)(n-m-l)\ 
2 

(n — m)(n — m — 1) \ 
2-v/mn 



E 4 + m E 



i : j—n — m+1 



7 / 



™( E 4 + E 4) 



(n — m)(n — to + 1) 
where we used the assumptions (1.1) and (2.6). Therefore, 



X ~ ra-4' 



Next wc bound <5 of (2.15). 
E|5' - 5| 3 



= E 



< E 



< 



1 



(n — m)(n — m — 1) 



E E (l^r( l ) + -Xjvy) - X iT{j) - X jTW | 3 |x) 



1< z',j <n— m 



(2.16) 



16 



(n — m)(n — m — 1) 



]T e((|x itW | 3 + |x Mj) | 3 + |x iT(i) | 3 + |x iT(i) | 3 )|x) 



1<?',j <n— m 



64n 



(n — m)(n — m — 1) 
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where we used the fact that 

\ X ir(j) + X jT(i) - X iT{i ) - X,v(j) I 3 

< 16(|X iT(j )| 3 + |-Xjv(i) | + |^ ir (i)| 3 + I-XjtO')! )• 



Therefore, 



5<^ 7 . (2.18) 

71 — 4 



Now we turn to the final step of bounding ^VarME^^S' - S , ) 2 /(|S ,/ - 5| < tf)^) J. Dc 



note 



a ij — P^r(i) + Xjr(j) ~ X iT{j) — X jr(i)) 2 I(\ X iT(i) + X ]t(j) ~ ^ir(j) — ^jV(i) I < 

We have 

^E((5'-5)»I(|5'-5|<5)|X 1 r) = i ^ E «&• 

l<i^j<n—m 

Therefore, with | • | meaning the cardinality of a set, 



Var E 



l(±.(S'-Sfl(\S'-S\<5)\s) 

<Vbx(^-e((S' - S) 2 I(\S' - S\ < S)\X,t 

= ,r( 1 Y2 i 2 E Var K) 
loin — m r J 

l<i^j<n—m 

+ E C ° V ("y:<' 3 ') 

+ E Cov(c^., <,,.,)} 

l<2 ;< 7,i' j' <n— m, j' |— 4 

The terms i?i and i?2 are easy to bound. 

2 n—m „ e 

" 16(n-mV ^ + - *<t W ) - ^r W | 3 < T^W (2.19) 

Prom Cov(X, Y") < (Var(X)+Var(F))/2, (2.17) and the restriction that i ^ j, i' ^ f, \i,j, 
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3, we have 

< 



16(n — m) 2 



16n<5 

< 7 

- n-4 



E 



l<i,j,i' ,j'<n—m,i^j,i'^j' ,j'\=3 



(2.20) 



where one factor of 64 comes from (2.17) by separating the four summands and the other 
factor 4 comes from the constraint = 3. Let afj = (X^, +Xji — Xu — Xjk) 2 I(\Xik + 

X jX - Xu - X jk \ < S). For = 4, 



Cov(a[., al ,,) = Ea[,aL - Ea[-Ea[, 



1 



(n — m)(n — m — l)(n — m — 2)(n — to — 3) 
1 



E 



(n — to)(ti — to — 1) 

v ' v y l<fc^Z<n-m 



E 

l<k,l,k',l'<n-m,\k,l,k',l'\=4 
1 



E„kl „ k' V 



(n — m)(n — m — 1) 



E 



l<k' y^l' <n—m 



(n — m)(n — in — l)(n — m — 2)(n — m — 3) 



E 



E 



l<k,l,k',l'<n-m,\k,l,k',l'\=4 



(n — m) 2 (n — m — l) 2 



in — m) 2 (n — to — 1) 



Ec^Eag' 



l<k,l,k' ,l'<n-m,\k,l,k' ,!'|=4 



E 



l<fc,i,fc',i'<n-m,fe^,fc'5^',|fc,Z,fc',i'|<3 

4(n — to) — 6 



(n — m) 2 (n — to — l) 2 (n — m — 2) (n — m — 3) 



E 



l<fc,Z,fc',/'<n-m,|fc,i,fc',Z'|=4 



(n — m) 2 (n — m — l) 2 



E 



l<fe,i,fe',i'<n— m,fc^,fc'^i',|fe,/,fe',i'|<3 
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Therefore, 



16(n — m) 5 



E 



1 ,j'<n— m, \i ,j' | —4 



4(n — m) — 6 
(n — m) 2 (n — m — l) 2 (n — m — 2)(n — m — 3) 

E(a«) a +E(c#'') a 



E 

l<k,l,k' ,l'<n-m,\k,l,k' ,i'|=4 



1 



< 



(n — m) 2 (n — m — l) 2 
1 



E 



E(og) a +E(o*j,) a 



16(n — m) 5 



E 

) + 



l<fc,!,k',!'<n-m,fc^i,fc'^Z',|fc,Z,fc',i'|<3 
4 



E(a*<) 2 



( V 

(n — m)(n — m — l) 2 ^— ' 2 



l</c'^Z'<ro— m 



(n — m)(n — m — l) 2 v ^— ■* 2 

v /v ; l<fe^i<„-m 



E 



< 



2(?i-m) 3 (n-m- l) 2 



E 



E E K) 2 



J' < n ~ Tn,\i ,j'\= 4 l<fc^Z<n— m 



< 



2(n - m) 3 ^ 



]T E|X lfe + X j7 - X if - X, fc | 3 < 32(-^) 3 5 7 



(2.21) 



From (2.19), (2.20), (2.21), and then applying (2.18), we obtain 



Var(W^(S" - S^IflS' - S| < 5)|5 



< 



< 



8n I 16n I 32f n I 3 
(n-4) 2 n-4 n-4 ; 



8n 



i^4_ 32(^^3./ 32n 



(2.22) 



(n-4) 2 n — 4 n— 4 V n ■ — 4 
Now we are ready to obtain a concentration inequality for S using Lemma 2.1. From (2.2), 
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and applying the bounds (2.16), (2.12), (2.18), (2.22), we obtain 
P(Se [a,b]) < 



n-5 ' (n-5)' 2 



2yH 

n— 4 



ra-1 
ra-2 



7 /II 2/3 

(n-4)c ' 



24w _ 
(n — 5) 2 7i—4 \/ n— 5 



8ra 
(n-4) 2 



16n i <jo^ n \3 / 32n 
n-4 "f" M Vn-4J V n-4 



n-1 _ 2n 24k 
™-2 (n-4)c^ /3 (^5F n-4 

< ci(b — a) + C27. 



2yrT / n j ~ 24n _ 1 
n-4 y n-5 ' (n-5) 2 2 



5) 



□ 



Remark 2.5. From Proposition 2.3, the error in [16] can be corrected by conditioning on 
(using their notation) 

J,K,L,M,t(J),t(K),t(L),t(M), 
{X l3 : i € {J, if, r _1 (i), r _1 (M)}, j e {£, M, r( J), r(JQ}}, 

and 6?/ applying our Proposition 2.3 instead of their Proposition 2.7. 
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From (1.1), EW = 0. The variance of W can be calculated as follows. From (1.1), 

n 

Var(W) = Var(^X i7rW ) 
i=l 

n 

= Var(X l7r(l) ) + ^ Cov(X l7r(j) , X j7r(j) ) 

i— 1 ^^i^j^n 
n 

= E E ( X ^M - Ci-f + E E ( X i*(i) ~ °i-)( X Mj) ~ Cj.) 
i—1 l<i^j<n 

1 - 1 

= - HXij - c,) 2 + - — - E E ( x * - - 

n n(n — 1) i: — ' 

i,j=l v ; i<i^j<n l<k^l<n 

1 - 1 

= - E + 4) + -? — tt E E CifcC ^ 

n j j n(n — 1) ^-^ •'-^ 

^ n 1 n 

= -E(4+4) + ^3T)E4 



4- 



= 1 E4 + ^rE 

n ^— ' J n - 1 ^ 

This proves the first part of the theorem. In the following, we work under the assumption 
that Vax(W) = 1, i.e., 

£,.7=1 »,j=l 

We assume 7 < 1/447, i.e., Co = 447 in Proposition 2.3. Otherwise the bound (1.3) is 
obviously true. From (3.1) and Holder's inequality, we have 

n n 

n - 1 < E EX ij ^ nV3 ( E E l^f) 2/3 = " 4/3 7 2/3 - (3.2) 

Therefore, wc have n > 199800 and (2.8) is satisfied. We prove Theorem 1.1 by applying 
the concentration inequality (2.9) in Stein's method. Wc follow the notation in Section 1 
and construct an exchangeable pair (VT, W) by uniformly selecting two different indices 
I,JG [n] (the ranges of / and J arc different from those in the proof of Proposition 2.3) and 
let W = W — X l7r (i) — XjnU) + Xj^ij) + Xjk^)- Following the argument as in (2.14), we 
have 

E(W' — W\W) = —XW + R (3.3) 
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The following bound on \/Ei? 2 
2 



< 



»(n-l)\ 



< 



Var( V Xij) = — t 

^ n(n — 1) \ f-r 1 J [n - l)v n 



T,4 



(3.4) 



is obtained by using the assumptions (1.1) and (3.1). 

From the fact that (W, W) is an exchangeable pair and satisfies an approximate linearity 
condition (3.3), the following functional identity can be proved following the same argument 
as in (2.5). 



EWf(W) = - W)(f(W) - f(W)) + Eii/(M/) 



A 



Let / be the bounded solution to the Stein equation 

f'(w) - wf(w) = I(w < z) — ®{z). 
It is known that (Chen and Shao [9]) 

^7T 



\f(w)\<^, |/'H|<1 Vwe 



and 



2tt 

\(w + u)f(w + u)-{w + v)f(w +v)\< (\w\ + ^— )(\ u \ + \v\). 



(3.5) 



(3.6) 



(3.7) 



(3.8) 



From (3.6) and (3.5), what we need to bound is 

F(W <z)- $(2) 
= Ef'{W) - EWf(W) 



= E/'(W)(1 - 
A 



(W - Wf 
2A 



— E(W - W) 



w'-w 



(f'(W)-f(W + t))dt 



■= Ri + R2 — R3 
From (3.4) and (3.7), and recalling A = 2/(n — 1), we have 

\R 3 \ < 1/Vn. (3.9) 
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To bound R\ and R2 , we need the concentration inequality obtained in the last section. 
From (3.3) and (3.4), 



E(W - Wf = 2A - 2EWR < 



<2A + 2VEi^<^_(l + ^). 



(3.10) 



> 2A - 2\ iH 1 > -Ml - -4=). 



We bound R2 first. From (3.6), 



r 2 = — :(ir' - wo 



w'-w 



(Wf(W) -(W + t)f(W + t))dt 

— ^2 ^( X iir(j) + X jir(i) ~ X iit{i) — X jn(j)) 



4ii 



l<i^j<n 



I(U < z — X iw ^ - Xjntj)) 



I(U < Z — X i7r ^) — X jv(J) ~ *) 



:= #2,1 + #2,2 

where U = J2i<k<n;k&,j x kn{k) • Noting that (7 is independent of {Xj^X^^X^y^X^^)} 
given 7r(i),7r(j), and that the conditional distribution of U given 7r(i),7r(j) is the same as 
the distribution of S in (2.7) (except for the indices but this does not matter), we can apply 
the concentration inequality (2.9) to obtain the following upper bound on | J?2,2 1 ■ 



jir(i) X iir(i) ^jf(j) , 

\<i^j<n 
X i7r (j)+X j7r(i) -X i7r(i) -X j7r(f) 

I(z - X in(i) - X Mj) -{tV0)<U <z- X„ (i) - X Mj) — (t A 0))dt 



X 3*U)) 



-4^ ^2 E ( X ™U) + X Mi) ~ X iir(i) - X jTr(j)) 



l<i^j<n 
X i7r (j)+X j7r(i) -X i7 , {i) -X^ {j) 







(c 1 \t\+c 2 ~f)dt 



£T X! E I^T(i) + X Mi) ~ X in(i) - X jir(j) I 3 
l<ijtj<n 



4ii 



1 



< 8ci7 + c 2 7(l + -7=). 
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In the last inequality, we used (2.17) and 

nx Mj) + X Ml) - - X Mj) | 2 = n(n - l)E(W - W) 2 

l<2^j<n 

< 4n(l+~L) 
V n 

where (3.10) is used in the last inequality. For i?2,i, from the property (3.8) of / with 

W = U,U = X l7r (i) + Xj n (j),V = X l7r ( t ) + Xfrfj) + t, 



X i*U) ~ X i-K(i) - X ^(j) 



II 



l<i^_;<7i 



<24(1 



24ti 



n — 5 (n - 5) 2 4 
where we used (2.12), (2.17) and 



2tt 

(\U\ + ^){2\X lA;i) +X, M]) \+t)dt 

(XiTr(j) + ~ Xi v (i) — Xj 7r (j)) 2 2|Xj 7r ( i ) + Xj„(j)\ 

\ X in(j) + X i*{i) ~ X^a) - Xj^tj) | 3 

2 

)7 



\XiTt(J) + Xjn(i) — Xi^^i) — Xj % (J} | 2 |-X"i7T(i) + Xj„fj} I 

< -^{\ X m(J)? + I^Jir( l )| 3 ) + yd^Ml 3 + I^JTrO')! 3 )- 



Therefore, with 



24n 



C3 := 



n — 5 (n — 5) 2 



| J R 2 |<(8c 1 +c 2 (l 



24c 3 + 6V2tt)7 



(3.11) 
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Next, we bound R\. 

Ri 

= Ef(W)(l- {W/ 2X W)2 ) 



n 2 (n — l) 2 

E E ( 

1 



(Xu + Xjk — x,ik — XjiY 



2A 



)\l = i,J = j, n(i) = k, 7r(j) = I 



n 2 (n — l) 2 



E E (/wa 



l<i,j,k,l<n,i^j ,k^l 



(Xu + Xjfc — X,fc — XjiY 

2A 



)|tt(i) = fc,7r(j) = £J 



since X, 7r and (I, J) are independent. For each choice of i ^ j, k ^ let X u := {X^, 

G [n]} be the same as X except that {Xi k , Xu, Xj k , Xji} has been replaced by an 
independent copy {X' ik , X' u , X' jk , X'^}. Define 



W ijki = x mi 

/ ^ vtt{v) 



Then, 



W 13 " 1 is independent of {X lk , X a , X jk , X }1 } and C{W l3kl ) = C{W). (3.12) 
Next, we define a new permutation -k^m coupled with 7r such that 

C(Tr ijk i) = C(ir\ir(i) = k,ir(j) = I). 

This coupling has been constructed by Chen [3] and also by Goldstein [11]. We describe this 
coupling as follows. 

n(a), a^i,j,TT- 1 (k),TT- 1 (l) 
k, a = i 
TTijki{a) = { I, a — j 

ir(i), a = 7r _1 (fc) 
a = -K~ 1 (l). 



Let 
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Since Wijki has the same distribution as the conditional distribution of W given ir(i) 
= I, and since X and 7r are independent, we have 



E 



(f(W)(l = k, = l) 



= Ef'(W m )( 



^ {Xu + Xjk — Xik — Xji 



2A 



Therefore, 

Rx 



=./ m i-t5) 



2A 



1 



I)) 2 ^ 1 



(i - (X " + Xjfc 2A ^ fc Xjl)2 )(f'(w ijkl ) - f(wv kl )) 



1 



Ffl - (— + - Xlk - X]l) } f'(W ijkl ) 
(n(n-l)) 2 ^ 1 2A jJ { >' 

l<i,j,k,l<n,i^j,kjtl 



E 



Define index sets X = {i,j,Tr- 1 (k),Tr' 1 (l)} and J = {k,l,n(i),n(J)}. Then \2\ = \J\ G 
{2,3,4}. Letting 5 = J2i>$x x i'n(i')i we can write 

Wjjfcj = 5 + ^2 X i'n ijkl (i>) 



i'ex 



and 



w nki = s + ^p x m 

i'ex 

Since S is a function depending only on the components of X outside the square X X S, 

S is independent of {X ih X jk , X lk , X jU ^ Xi,„.. kl ^, ^ X vk(v)} 

i'ex i'ex (3.13) 

given 7r _1 (fc), 7r _1 (/), 7r(i), 7r(j). 

The conditional distribution of S given 7r _1 (fc), 7r _1 (Z), 7r(i), 7r(j) is the same as the dis- 
tribution of 5 in (2.7) (except for the indices but this docs not matter). From (2.12), 
E(|S'||7r- 1 (fc),7r- 1 (0,7r(i),7r(j)) < c 3 . From (3.12), (3.7) and (3.10), 
1 



(n(n — l)) 2 
1 



E 



E(l - ( Xu+X 3 k Xik X i 1 ^ }fi(yyijkl} 



(n(n-l))' 



Ef'(W) Yl 



E( 



l<i,j,k,l<n,ij£j,kj£l 



■IX 

2A 



E/'(W)E(1 - 



fly/ - Wf 
2A 



< 



1 
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Therefore, 
l*i| 

< It^w £ ni - iXil+Xjk -* ik - Xjl)2 )(nw ijkl ) - nw^))\ 

^ ^ '' l<i,j,kd<n.i^j,k^l 

1 

_i 1 \ ~« 1B ,/ 1 PQf + -Xjfc — ^ife — Xji) 2 



'(n(n-l)) 2 ^ v 2A 

l<i,j,k,l<n,ijtj,kytl 

i'ei i'ex v 

< + R± a H — ^= 

where using the Stein equation (3.6), 

*((s+y: x^ iMil)) f{s + e x^ iMV) ) -(s+y: xt^ms + e *s )) i - 

i'EX i' £X z' EX i'GX 

and 

l<i,j,k,l<n,i^j,k^l 

x (7(5 + E < *) - J(S + E < *))|- 

Applying (3.8), (3.13), (2.12) and (3.1), Ri t i can be bounded as follows. 

< -, , 1 n , 2 E E(E(|5|| 7 r- 1 (A ; ) ) 7r- 1 (0,7r(i) )7 r(i)) + ^) 

(n(n — 1)) ' 4 

l<i,j,k,l<n,i^j,k^l 



x e{(| E w i + 1 E - 



i'6l i'Gl 



{ 8(c 3 + ^2^/4) „, , , V^n-l ] 

<max V ; 7 ' E E , 32 c 3 + ^— 7^ 

n z 4 n 

v 1 ' 

< m ffi 8 c 3 + 1 7 -)- r , 32 c 3 + ^— 7 



where we used 



1 1 ( X H + X jk - X ik - Xji) 2 , < 1 (jQj + Xjfc - X ik - Xji) 2 ^ ^ 

2 A 2 A 
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Now we bound R\ 2- 



1 



l<i,j,k,l<n,i^j,k^l k 

x (7(5+ 5] JCi^ w(i0 <z)-/(5+X) <,) < z))|7r- 1 (fc),7r- 1 (0 ) 7r(i),7r(;)] }| 

< 1 V e/e[|1- ( x « + x ik - X ik - Xjt) 2 

- (n(n-l)) 2 ^ \ L 1 2A 1 

x J( z - max{]T X^^), £ < 5 < z - min{£ ^ mW , £ 

|tt- :l (A;), tt- 1 (I), 7r(i), ttO?) 

Recall (3.13) and the fact that the conditional distribution of S given 7r _1 (fc), 7r — 1 (Z) , 7r(i), 7r(j) 
is the same as the distribution of S in (2.7) (except for the indices but this does not mat- 
ter). We can therefore apply the concentration inequality (2.9) to obtain the following upper 
bound on R\2- 



l<i,j,k,l<n,i^j 7 k^l 



\2 



W»»-1)) 2 K . 

x{ci(iE x ^(^)i + iE z Soi)+ c ^} 

i' 61 i' 61 

, 8 n - 1 i . 1 . 
<cimax{-p, 32 7)+c 2 (l + ^)7 



where we used (3.14) and (3.10). Therefore, 



|i?i|<( Cl + C3 + ^)max{4=, 32— 7 } + C2 (l + 4=h+4=- (3.15) 

4 \f Ti Tl yj ft yj ft 

Summing (3.15), (3.11), (3.9) yields an upper bound on sup zSR \P(W < z) — $(z)| as 
sup|P(W < z) - $(z)| 

/I \ (3 ' 16) 

< 40ci + 2(1 + —)c 2 + 14^27 +56c 3 + 2( -f' 2 7 

V v n n—l J 

where we used (3.2). Recall c = 447, n > 199800. Using c = 447 and n = 199800, the 

upper bound in (3.16) is calculated to be smaller than 4477. Since ci,C2 and C3 decrease as 

n increases, (1.3) holds for n > 199800. For n < 199800, 7 > 1/447 and (1.3) holds trivially. 

This completes the proof of Theorem 1.1. 
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